NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Fairquant: Certifying and Quantifying Fairness of Deep Neural Networks

https://doi.org/10.1109/ICSE55347.2025.00016

Kim, Brian Hyeongseok; Wang, Jingbo; Wang, Chao (April 2025, IEEE)

Free, publicly-accessible full text available April 26, 2026
An Incremental Algorithm for Algebraic Program Analysis

https://doi.org/10.1145/3704901

Zhou, Chenyu; Fang, Yuzhou; Wang, Jingbo; Wang, Chao (January 2025, Proceedings of the ACM on Programming Languages)

We propose a method for conducting algebraic program analysis (APA) incrementally in response to changes of the program under analysis. APA is a program analysis paradigm that consists of two distinct steps: computing a path expression that succinctly summarizes the set of program paths of interest, and interpreting the path expression using a properly-defined semantic algebra to obtain program properties of interest. In this context, the goal of an incremental algorithm is to reduce the analysis time by leveraging the intermediate results computed before the program changes. We have made two main contributions. First, we propose a data structure for efficiently representing path expression as a tree together with a tree-based interpreting method. Second, we propose techniques for efficiently updating the program properties in response to changes of the path expression. We have implemented our method and evaluated it on thirteen Java applications from the DaCapo benchmark suite. The experimental results show that both our method for incrementally computing path expression and our method for incrementally interpreting path expression are effective in speeding up the analysis. Compared to the baseline APA and two state-of-the-art APA methods, the speedup of our method ranges from 160X to 4761X depending on the types of program analyses performed.
more » « less
Full Text Available
Cyber Attack Detection in Electric Vehicle Charging Stations Using Topological Data Aided Learning

https://doi.org/10.1109/PESGM52009.2025.11225416

Jacob, Roshni Anna; Uddin, Md Joshem; Wang, Jingbo; Coskunuzer, Baris; Zhang, Jie (July 2025, IEEE)

Free, publicly-accessible full text available July 27, 2026
A numerical method for simulating variable density flows in membrane desalination systems

https://doi.org/10.1016/j.compfluid.2024.106449

Municchi, Federico; Liu, Yiming; Wang, Jingbo; Cath, Tzahi Y; Turchi, Craig S; Heeley, Michael B; Hoek, Eric MV; Jassby, David; Tilton, Nils (December 2024, Computers & Fluids)

We present a novel method for simulating unsteady, variable density, fluid flows in membrane desalination systems. By assuming the density varies only with concentration and temperature, the scheme decouples the solution of the governing equations into two sequential blocks. The first solves the governing equations for the temperature and concentration fields, which are used to compute all thermophysical properties. The second block solves the conservation of mass and momentum equations for the velocity and pressure. We show that this is computationally more efficient than schemes that iterate over the full coupled equations in one block. We verify that the method achieves second-order spatial–temporal accuracy, and we use the method to investigate buoyancy-driven convection in a desalination process called vacuum membrane distillation. Specifically, we show that with gravity properly oriented, variations in temperature and concentration can trigger a double-diffusive instability that enhances mixing and improves water recovery. We also show that the instability can be strengthened by providing external heating.
more » « less
Full Text Available
Generating buoyancy-driven convection in membrane distillation

https://doi.org/10.1016/j.memsci.2024.123043

Mabry, Miles; Municchi, Federico; Liu, Yiming; Wang, Jingbo; Cath, Tzahi Y; Turchi, Craig S; Heeley, Michael B; Hoek, Eric MV; Jassby, David; Martinand, Denis; et al (September 2024, Journal of Membrane Science)
NA (Ed.)
Membrane distillation (MD) is a thermally-driven desalination process that can treat hypersaline brines. Considerable MD literature has focused on mitigating temperature and concentration polarization. This literature largely neglects that temperature and concentration polarization increase the feed density near the membrane. With gravity properly oriented, this increase in density could trigger buoyancy-driven convection and increase permeate production. Convection could also be strengthened by heating the feed channel wall opposite the membrane. To investigate that possibility, we perform a series of experiments using a plate-and-frame direct contact MD system with an active membrane area of 300 cm2 and a feed channel wall heated using a resistive heater. The experiments measure the average transmembrane permeate flux for two gravitational orientations, feed Reynolds numbers between 128 and 1128, and wall heat fluxes up to 12 kW/m2. The results confirm that with gravity properly oriented, wall-heating can trigger buoyancy-driven convection for a wide range of feed Reynolds numbers, and increase permeate production between roughly 20 and 130 %. We estimate, however, that at high Reynolds numbers (𝑅𝑒 > 800), more than 70 % of the wall heat is carried out of the MD system by the feed flow, without contributing to permeate production. This suggests the need for longer membranes and heat recovery steps in any future practical implementation.
more » « less
Full Text Available
Certifying the Fairness of KNN in the Presence of Dataset Bias

https://doi.org/10.1007/978-3-031-37703-7_16

Li, Yannan; Wang, Jingbo; Wang, Chao (July 2023, Springer)

We propose a method for certifying the fairness of the classification result of a widely used supervised learning algorithm, the k-nearest neighbors (KNN), under the assumption that the training data may have historical bias caused by systematic mislabeling of samples from a protected minority group. To the best of our knowledge, this is the first certification method for KNN based on three variants of the fairness definition: individual fairness, ϵ -fairness, and label-flipping fairness. We first define the fairness certification problem for KNN and then propose sound approximations of the complex arithmetic computations used in the state-of-the-art KNN algorithm. This is meant to lift the computation results from the concrete domain to an abstract domain, to reduce the computational cost. We show effectiveness of this abstract interpretation based technique through experimental evaluation on six datasets widely used in the fairness research literature. We also show that the method is accurate enough to obtain fairness certifications for a large number of test inputs, despite the presence of historical bias in the datasets.
more » « less
Full Text Available
Systematic Testing of the Data-Poisoning Robustness of KNN

https://doi.org/10.1145/3597926.3598129

Li, Yannan; Wang, Jingbo; Wang, Chao (July 2023, ACM)

Data poisoning aims to compromise a machine learning based software component by contaminating its training set to change its prediction results for test inputs. Existing methods for deciding data-poisoning robustness have either poor accuracy or long running time and, more importantly, they can only certify some of the truly-robust cases, but remain inconclusive when certification fails. In other words, they cannot falsify the truly-non-robust cases. To overcome this limitation, we propose a systematic testing based method, which can falsify as well as certify data-poisoning robustness for a widely used supervised-learning technique named k-nearest neighbors (KNN). Our method is faster and more accurate than the baseline enumeration method, due to a novel over-approximate analysis in the abstract domain, to quickly narrow down the search space, and systematic testing in the concrete domain, to find the actual violations. We have evaluated our method on a set of supervised-learning datasets. Our results show that the method significantly outperforms state-of-the-art techniques, and can decide data-poisoning robustness of KNN prediction results for most of the test inputs.
more » « less
Full Text Available
Synthesizing MILP Constraints for Efficient and Robust Optimization

https://doi.org/10.1145/3591298

Wang, Jingbo; Gupta, Aarti; Wang, Chao (June 2023, Proceedings of the ACM on Programming Languages)

While mixed integer linear programming (MILP) solvers are routinely used to solve a wide range of important science and engineering problems, it remains a challenging task for end users to write correct and efficient MILP constraints, especially for problems specified using the inherently non-linear Boolean logic operations. To overcome this challenge, we propose a syntax guided synthesis (SyGuS) method capable of generating high-quality MILP constraints from the specifications expressed using arbitrary combinations of Boolean logic operations. At the center of our method is an extensible domain specification language (DSL) whose expressiveness may be improved by adding new integer variables as decision variables, together with an iterative procedure for synthesizing linear constraints from non-linear Boolean logic operations using these integer variables. To make the synthesis method efficient, we also propose an over-approximation technique for soundly proving the correctness of the synthesized linear constraints, and an under-approximation technique for safely pruning away the incorrect constraints. We have implemented and evaluated the method on a wide range of benchmark specifications from statistics, machine learning, and data science applications. The experimental results show that the method is efficient in handling these benchmarks, and the quality of the synthesized MILP constraints is close to, or higher than, that of manually-written constraints in terms of both compactness and solving time.
more » « less
Full Text Available
Learning to Synthesize Relational Invariants

https://doi.org/10.1145/3551349.3556942

Wang, Jingbo; Wang, Chao (October 2022, IEEE/ACM International Conference on Automated Software Engineering)

Full Text Available
Synthesizing Fair Decision Trees via Iterative Constraint Solving

Wang, Jingbo; Li, Yannan; Wang, Chao (August 2022, International Conference on Computer Aided Verification)

Full Text Available

« Prev Next »

Search for: All records